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RAW SEQUENCE LISTING 
ERROR REPORT 



BIOTECHNOLOGY <§£ < g> jg - 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 




TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4,2 PROGRAM ACCESSIBLE THROUGH THE U S PATENT AND 
TRADEMARK OEEICE WEBSITE SEE BELOW FOR ADDRESS • - . ’■ 

http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 
a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail 
Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for die 
United Slates Patent and Trademark Office, and instead should be sent via the following to tire indicated addresses: 

l EFS-Bio (<http://www. usoto.gov/cbc/cfs/downlonds/documcnts.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 06/05/04): 

U.S. Patent and Trademark Oflice, 220 20 th Street S., Customer Window, Mail Stop Sequence, Crystal Plaza Two, Lobby, 
- Room 1B03, Arlington, VA 22202 ’ - - • 



Revised 05/17/04 




Raw Sequence Listin g Error Summap' 

ATTN: NEW RULES CASES: PLEASE DISRCQARD ENGLISH “ALPHA** HEADER WHICH WERE INSERTED BY PTO SOFTWARE 

% 

I Wrapped Nucteics The numbcr/iext al the end of each line “wrapped** down to the next line. This may occur if your file 

Wrapped Aminos was retrieved in a word prdeessor after creating it. Please adjust your right margin to .3; this will 
prevent “wrapping.” 

2 Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

3 ^Misaligned Amino The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 

Numbering use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules Please 

ensure your subsequent submission Is saved in ASCII text. 

> 

Sequence(s) contain n's or Xaa’s representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing 

6 Patcntln 2.0 A “bug** in Patcntln version 2.0 has caused the <220>-<223> section to be missing from ammo acid 

“bug” scqucnccs(s) . Normally, Patcntln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. Tli is applies to (lie mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



4 Non-ASCII 

5 Variable Length 



ERROR DETECTED 



SUGGESTED CORRECTION 



SERIAL NUMBER:.! 



Skipped Sequences 
"(OLD RULES) 



Scquence(s) 



missing. If intentional, please insert the following lines for each skipped sequence 



(2) INFORMATION FOR SCQ ID NO X (insert SCQ ID NO where “X” is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insen any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION SEQ ID NO X: (insert $EQ ID NO where “X” is shown) 

This sequence is intentionally skipped * 

Please also adjust the “(u) NUMBER OF SEQUENCES :** response to include the skipped sequences 
missing If intentional, please insert the following lines for each skipped sequent 



Skipped Sequences Scqucnce(s) 

(NEW RULES) <2 1 0> sequence id number 
<400> sequence id number 
000 



or Xaa’s Use of n*s and/or Xaa’s have been detected in the Sequence Listing. 

RULES) Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n’s or Xaa's arc present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents 




Invalid <2 1 3> 
Response 



Per 1 .823 of Sequence Rules, t he only val id <2 1 3> resp onses arc : Un known , Artificial Sequence, or 
sc ientific nam e (Gcnus/spccics) <220>-<223> section is required when <2I3> response is Unknown or 
is Artificial Scoucncc y 



12 



Use of <220> 



Patentln 2.0 
“bug” 



Scqucncc(s) 



missing the <220> “Feature” and associated numeric identifiers a,. d responses. 



Use of <220> to <22 3> is MANDATORY if <2 1 3> “Organism” responses "Artificial Sequence” or 
“Unknown.” Please explain source of genetic material in <J20> to <223> section. 

(See “Federal Register, ’* OtfOl/1998, Vol. 63. No. 104, pp. 2963102) (See. 1 .823 of Sequence Rules) 

Please do not use “Copy to Disk" function of Patcntln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use “File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse of n/Xaa “n" can only represent a single nucleotide ; “Xaa" can only represent a single amino acid 



AMC- Biotechnology Systems Branch - 09/09/2003 



IFWO 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/787,267 



DATE: 07/20/2004 
TIME: 11:34:22 



4 

5 

6 
7 
9 

12 

14 

15 

17 

18 
20 
21 

23 

24 
26 
28 

30 

31 

32 

33 

35 

36 

37 

39 

40 



Input Set : A:\GC687-3-Dl-seqlist.txt 
Output Set : N:\CRF4\07202004\J787267.raw 

<110> APPLICANT: Dartois, Veronique A. 

Hoch, James A. 

Valle, Fernando 
Kumar, Manoj 

<12 0 > TITLE OF INVENTION: 2,5-DKG Permeases 
<130> FILE REFERENCE: GC687-3-D1 

<140> CURRENT APPLICATION NUMBER: US 10/787,267 
<141> CURRENT FILING DATE: 2004-02-25 
<150> PRIOR APPLICATION NUMBER: US 09/922,501 
<151> PRIOR FILING DATE: 2001-08-03 
<150> PRIOR APPLICATION NUMBER: US 60/325,774 
<151> PRIOR FILING DATE: 2000-08-04 
<150> PRIOR APPLICATION NUMBER: US 60/421,141 
<151> PRIOR FILING DATE: 2000-09-29 
<160> NUMBER OF SEQ ID NOS: 22 

< 170 > SOFTWARE: FastSEQ for Windows Version 4^ 

SEQ ID NO: 1 
LENGTH: 1500 
TYPE : DNA 

j 

environmental source 













CM<-/unk«0‘» 1 '^§ r 



<210 > 
< 211 > 
<212 > 
<2 13 > 
<22 0 > 
<22 1 > 
<222 > 
<400 > 



ORGANISM: 
FEATURE : 
NAME /KEY 
LOCATION 
SEQUENCE 



'MeaCe. i # (2 




CDS 

(94) . . . (1374) 

1 

ggcgaatagc ccggccggcg tcataataac ggccttctct gtaccctaca tacggcggcg 

41 gcgtcatgaa cctcaacttt agtaggcaag cct atg aac age tet acc aat gca 

42 Met Asn Ser Ser Thr Asn Ala 

43 1 5 



- r* 37 . ■ 

OK\ C ^ K ^ 

60 r Y 

ii4 ^UuLT' 



45 


aeg 


aaa 


ege 


tgg 


tgg 


tac 


ate 


atg 


cct 


ate 


gtg 


ttt 


ate 


aeg 


tat 


age 


162 


46 


Thr 


Lys 


Arg 


Trp 


Trp 


Tyr 


lie 


Met 


Pro 


lie 


Val 


Phe 


lie 


Thr 


Tyr 


Ser 




47 






10 










15 










20 










49 


ctg 


geg 


tat 


etc 


gac 


ege 


gca 


aac 


ttc 


age 


ttt 


get 


teg 


gca 


geg 


ggc 


210 


50 


Leu 


Ala 


Tyr 


Leu 


Asp 


Arg 


Ala 


Asn 


Phe 


Ser 


Phe 


Ala 


Ser 


Ala 


Ala 


Gly 




51 




25 










30 










35 












53 


att 


aeg 


gaa 


gat 


tta 


ggc 


att 


acc 


aaa 


ggc 


ate 


teg 


teg 


ett 


ett 


ggc 


258 


54 


lie 


Thr 


Glu 


Asp 


Leu 


Gly 


He 


Thr 


Lys 


Gly 


lie 


Ser 


Ser 


Leu 


Leu 


Gly 




55 


40 










45 










50 










55 




57 


gca 


ett 


ttc 


ttc 


etc 


ggc 


tat 


ttc 


ttc 


ttc 


cag 


ate 


ccg 


ggg 


geg 


att 


306 


58 


Ala 


Leu 


Phe 


Phe 


Leu 


Gly 


Tyr 


Phe 


Phe 


Phe 


Gin 


lie 


Pro 


Gly 


Ala 


lie 




59 










60 










65 










70 






61 


tac 


geg 


gaa 


ege 


cgt 


age 


gta 


egg 


aag 


ctg 


att 


ttc 


ate 


tgt 


ctg 


ate 


354 


62 


Tyr 


Ala 


Glu 


Arg 


Arg 


Ser 


Val 


Arg 


Lys 


Leu 


lie 


Phe 


lie 


Cys 


Leu 


lie 




63 








75 










80 










85 









file://C:\CRF4\Outhold\VsrJ787267.htm 



7/20/04 





Page 2 of 7 



RAW SEQUENCE LISTING DATE: 07/20/2004 

PATENT APPLICATION: US/10/787,267 TIME: 11:34:22 

Input Set : A:\GC687-3-Dl-seqlist.txt 









Output 


Set: 


N:\CRF4\07202004\J787267.raw 








65 ctg 


tgg 


ggc 


gcc 


tgc 


gcc 


teg 


ett 


gac 


egg 


gat 


ggt 


gca 


caa 


tat 


tcc 


402 


66 Leu 


Trp 


Gly Ala 


Cys 


Ala 


Ser 


Leu 


Asp 


Arg 


Asp 


Gly 


Ala 


Gin 


Tyr 


Ser 




67 




90 










95 










100 










6 9 age 


get 


ggc 


tgg 


ega 


tcc 


gtt 


tta 


ttc 


teg 


get 


gtc 


gtg 


gaa 


geg 


geg 


450 


70 Ser 


Ala 


Gly 


Trp 


Arg 


Ser 


Val 


Leu 


Phe 


Ser 


Ala 


Val 


Val 


Glu 


Ala 


Ala 




71 


105 










110 










115 












73 gtc 


atg 


ccg 


geg 


atg 


ctg 


att 


tac 


ate 


agt 


aac 


tgg 


ttt 


acc 


aaa 


tea 


498 


74 Val 


Met 


Pro 


Ala 


Met 


Leu 


lie 


Tyr 


lie 


Ser 


Asn 


Trp 


Phe 


Thr 


Lys 


Ser 




75 120 










125 










130 










135 




7 7 gaa 


cgt 


tea 


ege 


gcc 


aac 


acc 


ttc 


tta 


ate 


etc 


ggc 


aac 


ccg 


gtc 


aeg 


546 


78 Glu 


Arg 


Ser 


Arg 


Ala 


Asn 


Thr 


Phe 


Leu 


lie 


Leu 


Gly 


Asn 


Pro 


Val 


Thr 




79 








140 










145 










150 






81 gta 


ctg 


tgg 


atg 


teg 


gtg 


gtc 


tcc 


ggc 


tac 


ctg 


att 


cag 


tcc 


ttc 


ggc 


594 


82 Val 


Leu 


Trp 


Met 


Ser 


Val 


Val 


Ser 


Gly 


Tyr 


Leu 


lie 


Gin 


Ser 


Phe 


Gly 




83 






155 










160 










165 








85 tgg 


cgt 


gaa 


atg 


ttt 


att 


att 


gaa 


ggc 


gtt 


ccg 


gcc 


gtc 


etc 


tgg 


gee 


642 


86 Trp 


Arg 


Glu 


Met 


Phe 


lie 


lie 


Glu 


Gly 


Val 


Pro 


Ala 


Val 


Leu 


Trp 


Ala 




87 




170 










175 










180 










89 ttc 


tgc 


tgg 


tgg 


gtg 


ctg 


gtc 


aaa 


gtt 


aaa 


ccg 


teg 


cag 


gtg 


aac 


tgg 


690 


90 Phe 


Cys 


Trp 


Trp 


Val 


Leu 


Val 


Lys 


Val 


Lys 


Pro 


Ser 


Gin 


Val 


Asn 


Trp 




91 


185 










190 










195 












93 ttg 


teg 


gaa 


aac 


gag 


aaa 


gcc 


geg 


ctg 


cag 


geg 


cag 


ctg 


gag 


age 


gag 


738 


94 Leu 


Ser 


Glu 


Asn 


Glu 


Lys 


Ala 


Ala 


Leu 


Gin 


Ala 


Gin 


Leu 


Glu 


Ser 


Glu 




95 200 










205 










210 










215 




97 cag 


cag 


ggt 


att 


aaa 


gcc 


gtg 


cgt 


aac 


tac 


ggc 


gaa 


gcc 


ttc 


ege 


tea 


786 


98 Gin 


Gin 


Gly 


He 


Lys 


Ala 


Val 


Arg 


Asn 


Tyr 


Gly 


Glu 


Ala 


Phe 


Arg 


Ser 




99 








220 










225 










230 






101 ege 


aac 


gtc 


att 


eta 


ctg 


tgc 


atg 


cag 


tat 


ttt 


gcc 


tgg 


agt 


ate 


ggc 


834 


102 Arg 


Asn 


Val 


lie 


Leu 


Leu 


Cys 


Met 


Gin 


Tyr 


Phe 


Ala 


Trp 


Ser 


lie 


Gly 




103 






235 










240 










245 








105 gtg 


tac 


ggt 


ttt 


gtg 


ctg 


tgg 


ttg 


ccg 


tea 


att 


att 


ege 


age 


ggc 


ggc 


882 


106 Val 


Tyr 


Gly 


Phe 


Val 


Leu 


Trp 


Leu 


Pro 


Ser 


lie 


lie 


Arg 


Ser 


Gly 


Gly 




107 




250 










255 










260 










109 gtc 


aat 


atg 


ggg 


atg 


gtg 


gaa 


gtc 


ggc 


tgg 


etc 


tet 


teg 


gtg 


cct 


tat 


930 


110 Val 


Asn 


. Met 


Gly 


Met 


val 


Glu 


Val 


Gly 


Trp 


Leu 


Ser 


Ser 


Val 


Pro 


Tyr 




111 


265 










270 










275 












113 ctg 


gcc 


geg 


act 


att 


geg 


atg 


ate 


gtc 


gtc 


tcc 


tgg 


get 


tcc 


gat 


aaa 


978 


114 Leu 


Ala 


Ala 


Thr 


lie 


Ala 


Met 


lie 


Val 


Val 


Ser 


Trp 


Ala 


Ser 


Asp 


Lys 




115 280 










285 










290 










295 




117 atg 


cag 


aac 


cgt 


aaa 


ctg 


ttc 


gtc 


tgg 


ccg 


ctg 


ctg 


ctg 


att 


ggc 


gga 


1026 


118 Met 


Gin 


. Asn 


Arg 


Lys 


Leu 


Phe 


Val 


Trp 


Pro 


Leu 


Leu 


Leu 


lie 


Gly 


Gly 




119 








300 










305 










310 






121 ctg 


get 


ttt 


att 


ggc 


tea 


tgg 


gcc 


gtc 


ggc 


get 


aac 


cat 


ttc 


tgg 


gee 


1074 


122 Leu 


Ala 


Phe 


lie 


Gly 


Ser 


Trp 


Ala 


Val 


Gly 


Ala 


Asn 


His 


Phe 


Trp 


Ala 




123 






315 










320 










325 








125 tet 


tat 


a cc 


ctg 


ctg 


gtg 


att 


gcc 


aat 


geg 


gca 


atg 


tac 


gcc 


cct 


tac 


1122 


126 Ser 


Tyr 


Thr 


Leu 


Leu 


Val 


lie 


Ala 


Asn 


Ala 


Ala 


Met 


Tyr 


Ala 


Pro 


Tyr 




127 




330 










335 










340 










129 ggt 


ccg 


ttt 


ttc 


gcc 


ate 


att 


ccg 


gaa 


atg 


ctg 


ccg 


cgt 


aac 


gtc 


gee 


1170 



file://C:\CRF4\Outhold\VsrJ787267.htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/787,267 



DATE: 07/20/2004 

TIME: 11:34:22 



Input Set : A:\GC687-3-Dl-seqlist.txt 
Output Set: N:\CRF4\07202004\J787267.raw 

130 Gly Pro Phe Phe Ala lie lie Pro Glu Met Leu Pro Arg Asn Val Ala 

131 345 350 355 

133 ggt ggc gca atg gcg etc ate aac age atg ggg gee tta ggt tea ttc 

134 Gly Gly Ala Met Ala Leu lie Asn Ser Met Gly Ala Leu Gly Ser Phe 

135 360 365 370 . 375 

137 ttt ggt teg tgg ttc gtg ggc tac etg aac ggc acc acc ggc agt cca 

138 Phe Gly Ser Trp Phe Val Gly Tyr Leu Asn Gly Thr Thr Gly Ser Pro 

139 380 385 390 

141 tea gcc tea tac att ttc atg gga gtg gcg ett ttc gcc teg gta tgg 

142 Ser Ala Ser Tyr He Phe Met Gly Val Ala Leu Phe Ala Ser Val Trp 



\p \ -c cx-^- 



145 ett act tta att gtt aag cct get aac aat caa aag etc ccc ate ggc 

146 Leu Thr Leu lie Val Lys Pro Ala Asn Asn Gin Lys Leu Pro lie Gly 

147 410 415 420 

149 get cgt cac gcc tgacctttac taettaegga gatcacgcct tgggtacgtt 

150 Ala Arg His Ala 

151 425 

153 gcaggacaaa ccgataggca ccgcaaaggc tggggccatc gageagegeg taaacagtca 

154 gctggttgct gtcgctgtgc ggcgtc 

156 <210> SEQ ID NO: 2 

157 <2 11> LENGTH: 427 / 

158 <2 12 > TYPE: PRT^-* ^ 

15 9 <213 > ORGANISM: (environmental sour^fe ^ ^^; ^ <T jp_Q_ 

161 <400 > SEQUENCE: 2 

162 Met Asn Ser Ser Thr Asn Ala Thr Lys Arg Trp Trp Tyr lie Met Pro ^ 

163 15 10 15 

164 lie Val Phe lie Thr Tyr Ser Leu Ala Tyr Leu Asp Arg Ala Asn Phe 

165 20 25 30 

166 Ser Phe Ala Ser Ala Ala Gly lie Thr Glu Asp Leu Gly lie Thr Lys 

167 35 40 45 

168 Gly lie Ser Ser Leu Leu Gly Ala Leu Phe Phe Leu Gly Tyr Phe Phe 

169 50 55 60 

170 Phe Gin lie Pro Gly Ala lie Tyr Ala Glu Arg Arg Ser Val Arg Lys 

171 65 70 75 80 

172 Leu lie Phe lie Cys Leu lie Leu Trp Gly Ala Cys Ala Ser Leu Asp 

173 85 90 95 

174 Arg Asp Gly Ala Gin Tyr Ser Ser Ala Gly Trp Arg Ser Val Leu Phe 

175 100 105 110 

176 Ser Ala Val Val Glu Ala Ala Val Met Pro Ala Met Leu lie Tyr lie 

177 115 120 125 

178 Ser Asn Trp Phe Thr Lys Ser Glu Arg Ser Arg Ala Asn Thr Phe Leu 

179 130 135 140 

180 lie Leu Gly Asn Pro Val Thr Val Leu Trp Met Ser Val Val Ser Gly 

181 145 150 155 160 

182 Tyr Leu lie Gin Ser Phe Gly Trp Arg Glu Met Phe lie lie Glu Gly 

183 165 170 175 

184 Val Pro Ala Val Leu Trp Ala Phe Cys Trp Trp Val Leu Val Lys Val 

185 180 185 190 

186 Lys Pro Ser Gin Val Asn Trp Leu Ser Glu Asn Glu Lys Ala Ala Leu 



Or\ 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/787,267 



DATE: 07/20/2004 

TIME: 11:34:22 



Input Set : A:\GC687-3-Dl-seqlist.txt 
Output Set: N:\CRF4\07202004\J787267.raw 



187 






195 










200 










205 








188 


Gin 


Ala 


Gin 


Leu 


Glu 


Ser 


Glu 


Gin 


Gin 


Gly 


lie 


Lys 


Ala 


Val 


Arg 


Asn 


189 




210 










215 










220 










190 


Tyr 


Gly 


Glu 


Ala 


Phe 


Arg 


Ser 


Arg 


Asn 


Val 


lie 


Leu 


Leu 


Cys 


Met 


Gin 


191 


225 










230 










235 










240 


192 


Tyr 


Phe 


Ala 


Trp 


Ser 


lie 


Gly 


Val 


Tyr 


Gly 


Phe 


Val 


Leu 


Trp 


Leu 


Pro 


193 










245 










250 










255 




194 


Ser 


He 


lie 


Arg 


Ser 


Gly 


Gly 


Val 


Asn 


Met 


Gly 


Met 


Val 


Glu 


Val 


Gly 


195 








260 










265 










270 






196 


Trp 


Leu 


Ser 


Ser 


Val 


Pro 


Tyr 


Leu 


Ala 


Ala 


Thr 


lie 


Ala 


Met 


lie 


Val 


197 






275 










280 










285 








198 


Val 


Ser 


Trp 


Ala 


Ser 


Asp 


Lys 


Met 


Gin 


Asn 


Arg 


Lys 


Leu 


Phe 


Val 


Trp 


199 




290 










295 










300 










200 


Pro 


Leu 


Leu 


Leu 


lie 


Gly 


Gly 


Leu 


Ala 


Phe 


lie 


Gly 


Ser 


Trp 


Ala 


Val 


201 


305 










310 










315 










320 


202 


Gly 


Ala 


Asn 


His 


Phe 


Trp 


Ala 


Ser 


Tyr 


Thr 


Leu 


Leu 


Val 


lie 


Ala 


Asn 


203 










325 










330 










335 




204 


Ala 


Ala 


Met 


Tyr 


Ala 


Pro 


Tyr 


Gly 


Pro 


Phe 


Phe 


Ala 


lie 


lie 


Pro 


Glu 


205 








340 










345 










350 






206 


Met 


Leu 


Pro 


Arg 


Asn 


Val 


Ala 


Gly 


Gly 


Ala 


Met 


Ala 


Leu 


lie 


Asn 


Ser 


207 






355 










360 










365 








208 


Met 


Gly 


Ala 


Leu 


Gly 


Ser 


Phe 


Phe 


Gly 


Ser 


Trp 


Phe 


Val 


Gly 


Tyr 


Leu 


209 




370 










375 










380 










210 


Asn 


Gly 


Thr 


Thr 


Gly 


Ser 


Pro 


Ser 


Ala 


Ser 


Tyr 


lie 


Phe 


Met 


Gly 


Val 


211 


385 










390 










395 










400 


212 


Ala 


Leu 


Phe 


Ala 


Ser 


Val 


Trp 


Leu 


Thr 


Leu 


lie 


Val 


Lys 


Pro 


Ala 


Asn 


213 










405 










410 










415 




214 


Asn 


Gin 


Lys 


Leu 


Pro 


lie 


Gly 


Ala 


Arg 


His 


Ala 












215 








420 










425 





















(1491) 



217 <210> SEQ ID NO: 3 

218 <2 11> LENGTH: 1775 

219 <2 12 > TYPE: DNA 

220 <2 13 > ORGANISM: 

222 <220> FEATURE: 

223 <221> NAME/KEY 

224 <222 > LOCATION 

226 <4 00 > SEQUENCE 

227 ggcaatttgc ggtgtttttt ccgcaggacg ttcatcgtcc ggcctgtatt catcaacggc 

228 cctgcgctat tcgcaaagtg gtggtgaaaa taccgctgcg ttatttaacg cccaataagc 

229 aacaccgagt ttataaccct gaacgacacg gctgcgggcc tgtgtagacg cccctacgcc 

230 ttaacaccac taaatgactc tacaggtgta tat atg aat aca gcc tct gtt tct 

231 Met Asn Thr Ala Ser Val Ser 

232 1 5 



234 


gtc 


acc 


caa 


age 


cag 


geg 


ate 


ccc 


aaa 


tta 


ege 


tgg 


ttg 


aga 


235 


Val 


Thr 


Gin 


Ser 


Gin 


Ala 


lie 


Pro 


Lys 


Leu 


Arg 


Trp 


Leu 


Arg 


236 






10 










15 










20 




238 


ccg 


cct 


att 


ett 


att 


acc 


tgc 


att 


att 


tcc 


tat 


atg 


gac 


egg 


239 


Pro 


Pro 


lie 


Leu 


lie 


Thr 


Cys 


lie 


lie 


Ser 


Tyr 


Met 


Asp 


Arg 


240 




25 










30 










35 
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PATENT APPLICATION: US/10/787,267 



DATE: 07/20/2004 

TIME: 11:34:22 



Input Set : A:\GC687-3-Dl-seqlist.txt 
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242 


ate 


gcc 


ttc 


gcc 


atg 


ccc 


ggc 


ggc 


atg 


gac 


gat 


gaa 


ctg 


ggc 


ate 


acc 


378 


243 


He 


Ala 


Phe 


Ala 


Met 


Pro 


Gly 


Gly 


Met 


Asp 


Asp 


Glu 


Leu 


Gly 


lie 


Thr 




244 


40 










45 










50 










55 




246 


gcc 


teg 


atg 


gcc 


ggg 


ttg 


gcc 


ggc 


ggt 


att 


ttc 


ttt 


ate 


ggt 


tat 


ctg 


426 


247 


Ala 


Ser 


Met 


Ala 


Gly 


Leu 


Ala 


Gly 


Gly 


lie 


Phe 


Phe 


He 


Gly 


Tyr 


Leu 




248 










60 










65 










70 






250 


ttc 


ttg 


cag 


gta 


ccc 


ggc 


ggc 


aag 


ctg 


geg 


gtg 


tac 


ggc 


aac 


ggc 


aag 


474 


251 


Phe 


Leu 


Gin 


Val 


Pro 


Gly 


Gly 


Lys 


Leu 


Ala 


Val 


Tyr 


Gly 


Asn 


Gly 


Lys 




252 








75 










80 










85 








254 


aaa 


ttc 


ate 


ggt 


tgg 


teg 


ttg 


ttg 


gcc 


tgg 


geg 


gtg 


att 


tec 


gtg 


ctg 


522 


255 


Lys 


Phe 


lie 


Gly 


Trp 


Ser 


Leu 


Leu 


Ala 


Trp 


Ala 


Val 


lie 


Ser 


Val 


Leu 




256 






90 










95 










100 










258 


acc 


ggg 


ctg 


gtc 


aeg 


aat 


cag 


tat 


caa 


ttg 


ctg 


ttc 


ctg 


ege 


ttc 


gee 


570 


259 


Thr 


Gly 


Leu 


Val 


Thr 


Asn 


Gin 


Tyr 


Gin 


Leu 


Leu 


Phe 


Leu Arg 


Phe 


Ala 




260 




105 










110 










115 












262 


etc 


ggc 


cgt 


ttc 


ega 


age 


ggc 


atg 


ctg 


egg 


tgg 


gtg 


ctg 


acc 


atg 


ate 


618 


263 


Leu 


Gly 


Arg 


Phe 


Arg 


Ser 


Gly 


Met 


Leu 


Arg 


Trp 


Val 


Leu 


Thr 


Met 


lie 




264 


120 










125 










130 










135 




266 


age 


aac 


tgg 


ttc 


ccg 


gac 


aag 


gaa 


eg c 


ggg 


ege 


gee 


aac 


gcc 


ate 


gtc 


666 


267 


Ser 


Asn 


Trp 


Phe 


Pro 


Asp 


Lys 


Glu 


Arg 


Gly Arg 


Ala 


Asn 


Ala 


lie 


Val 




268 










' 140 










145 










150 






270 


ate 


atg 


ttc 


gtg 


ccg 


ate 


gcc 


ggc 


ate 


ett 


acc 


gca 


ccg 


ctg 


tec 


ggc 


714 


271 


lie 


Met 


Phe 


Val 


Pro 


lie 


Ala 


Gly 


lie 


Leu 


Thr 


Ala 


Pro 


Leu 


Ser 


Gly 




272 








155 










160 








- 


165 








274 


tgg 


ate 


ate 


acc 


gcc 


tgg 


gac 


tgg 


ege 


atg 


ctg 


ttc 


ctg 


gtc 


gag 


ggc 


762 


275 


Trp 


lie 


lie 


Thr 


Ala 


Trp 


Asp 


Trp 


Arg 


Met 


Leu 


Phe 


Leu 


Val 


Glu 


Gly 




276 






170 










175 










180 










278 


geg 


ctg 


teg 


ctg 


gtc 


gtg 


atg 


gtg 


ctg 


tgg 


tat 


ttc 


acc 


ate 


age 


aac 


810 


279 


Ala 


Leu 


Ser 


Leu 


Val 


Val 


Met 


Val 


Leu 


Trp 


Tyr 


Phe 


Thr 


lie 


Ser 


Asn 




280 




185 










190 










195 












282 


cgt 


cca 


caa 


gag 


gcc 


aaa 


agg 


att 


teg 


cag 


geg 


gaa 


aaa 


gat 


tat 


ctg 


858 


283 


Arg 


Pro 


Gin 


Glu 


Ala 


Lys 


Arg 


lie 


Ser 


Gin 


Ala 


Glu 


Lys 


Asp 


Tyr 


Leu 




284 


200 










205 










210 










215 




286 


ate 


aaa 


aeg 


ctg 


cac 


gac 


gaa 


cag 


ttg 


ctg 


ate 


aaa 


ggc 


aaa 


aeg 


gtg 


906 


287 


lie 


Lys 


Thr 


Leu 


His 


Asp 


Glu 


Gin 


Leu 


Leu 


lie 


Lys 


Gly 


Lys 


Thr 


Val 




288 










220 










225 










230 






290 


ege 


aac 


gcc 


teg 


ctg 


cgt 


egg 


gtg 


ctg 


ggc 


gac 


aaa 


ate 


atg 


tgg 


aag 


954 


291 


Arg 


Asn 


Ala 


Ser 


Leu 


Arg 


Arg 


Val 


Leu 


Gly Asp 


Lys 


lie 


Met 


Trp 


Lys 




292 








235 










240 










245 








294 


ttc 


ttc 


tac 


cag 


acc 


ggg 


ata 


tac 


ggc 


tac 


acc 


ctg 


tgg 


ctg 


ccg 


acc 


1002 


295 


Phe 


Phe 


Tyr 


Gin 


Thr 


Gly 


lie 


Tyr 


Gly 


Tyr 


Thr 


Leu 


Trp 


Leu 


Pro 


Thr 




296 






250 










255 










260 










298 


att 


etc 


aag 


ggg 


etc 


acc 


aac 


ggc 


aat 


atg 


gag 


cag 


gtc 


ggg 


atg 


ctg 


1050 


299 


lie 


Leu 


Lys 


Gly 


Leu 


Thr 


Asn 


Gly 


Asn 


Met 


Glu 


Gin 


Val 


Gly 


Met 


Leu 




300 




265 










270 










275 












302 


get 


ate 


ctg 


ccc 


tat 


ate 


ggc 


gcc 


ate 


ttc 


ggc 


atg 


ctg 


ate 


att 


tec 


1098 


303 


Ala 


lie 


Leu 


Pro 


Tyr 


lie 


Gly 


Ala 


lie 


Phe 


Gly 


Met 


Leu 


lie 


lie 


Ser 




304 


280 










285 










290 










295 




306 


acc 


etc 


tcc 


gat 


ege 


acc 


ggc 


aag 


ege 


aaa 


gtg 


ttc 


gtc 


gca 


ctg 


ccg 


1146 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/787,267 

Input Set ; A:\GC687-3-Dl-seqlist.txt 
Output Set: N:\CRF4\07202004\J787267.raw 

: 1 172 M : 258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#: 13 
: 1175 M : 2 58 W : Mandatory Feature missing, <220> Tag not found for SEQ ID#: 13 
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